Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition

نویسندگان

Rivarol Vergin

Douglas D. O'Shaughnessy

Azarshid Farhat

چکیده

The focus of a continuous speech recognition process is to match an input signal with a set of words or sentences according to some optimality criteria. The first step of this process is parameterization, whose major task is data reduction by converting the input signal into parameters while preserving virtually all of the speech signal information dealing with the text message. This contribution presents a detailed analysis of a widely used set of parameters, the mel frequency cepstral coefficients (MFCC’s), and suggests a new parameterization approach taking into account the whole energy zone in the spectrum. Results obtained with the proposed new coefficients give a confidence interval about their use in a large-vocabulary speaker-independent continuous-speech recognition system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Building Medium-Vocabulary Isolated-Word Lithuanian HMM Speech Recognition System

In this paper, the opening work on the development of a Lithuanian HMM speech recognition system is described. The triphone single-Gaussian HMM speech recognition system based on Mel Frequency Cepstral Coefficients (MFCC) was developed using HTK toolkit. Hidden Markov model’s parameters were estimated from phone-level hand-annotated Lithuanian speech corpus. The system was evaluated on a speake...

متن کامل

Neuro Based Approach for Speech Recognition by Using Mel-frequency Cepstral Coefficients

NEURO BASED APPROACH FOR SPEECH RECOGNITION BY USING MEL-FREQUENCY CEPSTRAL COEFFICIENTS R.L.K. Venkateswarlu1 and R. Vasanthakumari2 1 Department of Information Technology, Sasi Institute of Technology and Engineering, Tadepalligudem, India, E-mail: [email protected]. 2 Perunthalaivar Kamarajar Arts College, Puducherry-605107, India, E-mail: [email protected]. This paper presents continu...

متن کامل

New Filter Structure based on Admissible Wavelet Packet Transform for Text-Independent Speaker Identification

Identical acoustic features like Mel frequency cepstral Coefficients (MFCC)and Linear predictive cepstral coefficients (LPCC) are being widely used for different tasks like speech recognition and speaker recognition, whereas the requirement of speaker recognition is different than that of speech recognition. In MFCC feature representation, the Mel frequency scale is used to get a high resolutio...

متن کامل

Real Time Speech Recognition Using DSK TMS320C6713

Speech recognition is an important field of digital signal processing. Automatic Speaker Recognition (ASR) objective is to extract features, characterize and recognize speaker. Mel Frequency Cepstral Coefficients (MFCC) is most widely used feature vector for ASR. MFCC is used for designing a text dependent speaker identification system. In this paper the DSP processor TMS320C6713 with Code Comp...

متن کامل

Cepstrum derived from differentiated power spectrum for robust speech recognition

In this paper, cepstral features derived from the differential power spectrum (DPS) are proposed for improving the robustness of a speech recognizer in presence of background noise. These robust features are computed from the speech signal of a given frame through the following four steps. First, the short-time power spectrum of speech signal is computed from the speech signal through the fast ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

IEEE Trans. Speech and Audio Processing

دوره 7 شماره

صفحات -

تاریخ انتشار 1999

Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition

نویسندگان

چکیده

منابع مشابه

Building Medium-Vocabulary Isolated-Word Lithuanian HMM Speech Recognition System

Neuro Based Approach for Speech Recognition by Using Mel-frequency Cepstral Coefficients

New Filter Structure based on Admissible Wavelet Packet Transform for Text-Independent Speaker Identification

Real Time Speech Recognition Using DSK TMS320C6713

Cepstrum derived from differentiated power spectrum for robust speech recognition

عنوان ژورنال:

اشتراک گذاری